Live freelance tracking. Raw descriptions turned into structured data. Find your next tech project without the noise.
freelancer.com π’ 2026-05-30
πΉ Multi-Source Real Estate Data Pipeline (Venezuela)
π€ Client: πΊπΈ Miami, United States Member since 2012-11-28
π° Price: $1064 Average bid
π© Problem: Need for a weekly normalized market intelligence dataset from 8 disparate property portals to track valuations and agent activity across three regional offices.
π¦ Existing: Apify account, AWS PostgreSQL database, internal AWS developer for Lambda deployment.
Specifications:
[Target] 8 Venezuelan property portals (MercadoLibre, Rivinotinto, MLS Caracas, RE/MAX, Rent-a-House, Century21, TuHome24, PERAIG)
[Method] Hierarchy: Internal API β Plain HTML β Headless Browser (Playwright)
[Method] Two-stage extraction: List pages (URLs/Cards) β Detail pages (Attributes/Agents)
[Method] Incremental fetching: Price checks via list pages; full fetch only for new/changed listings
[UI/UX] Internal search tool for agent attribution
[Stack] TypeScript, Crawlee, Apify Actors, AWS Lambda, PostgreSQL (JSONB)
[Security] Residential proxies (MercadoLibre/PERAIG), DataDome bypass
[Format] Normalized schema: properties, portals, agencies, agents, price_history, scrape_runs
[Format] No image assets; URL and structured data only
Workflow:
1. Deploy Apify Actor with domain-specific adapters and router.
2. Execute weekly scrape using method hierarchy (API > HTML > Browser).
3. Normalize data by office (RM I/II/III) and geographic state.
4. Push data via Apify Dataset β Webhook β AWS Lambda.
5. Upsert to PostgreSQL with change detection for price history tracking.
6. Trigger Slack alerts for adapter failures or data drops.